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Abstract 



CN| I The A-calculus is a widely accepted computational model of higher-order functional pro- 

grams, yet there is not any direct and universally accepted cost model for it. As a consequence, 
^ ■ the computational difficulty of reducing A-terms to their normal form is typically studied by 

, reasoning on concrete implementation algorithms. In this paper, we show that when head 

reduction is the underlying dynamics, the unitary cost model is indeed invariant. This im- 
proves on known results, which only deal with weak (call-by- value or call-by-name) reduction. 
Invariance is proved by way of a linear calculus of explicit substitutions, which allows to nicely 
■ decompose any head reduction step in the A-calculus into more elementary substitution steps, 

thus making the combinatorics of head-reduction easier to reason about. The technique is 
also a promising tool to attack what we see as the main open problem, namely understanding 



^ ■ for which normalizing strategies derivation complexity is an invariant cost model, if any. 

1 Introduction 



Giving an estimate of the amount of time T needed to execute a program is a natural refinement of 
the termination problem, which only requires to decide whether T is either finite or infinite. The 
shift from termination to complexity analysis brings more informative outcomes at the price of an 
increased difficulty. In particular, complexity analysis depends much on the chosen computational 
model. Is it possible to express such estimates in a way which is independent from the specific 
I machine the program is run on? An answer to this question can be given following computational 

complexity, which classifies functions based on the amount of time (or space) they consume when 
executed by any abstract device endowed with a reasonable cost model, depending on the size of 
input. When can a cost model be considered reasonable? The answer lies in the so-called invariance 
I thesis [21]: any time cost model is reasonable if it is polynomially related to the (standard) one 

of Turing machines. 

If programs are expressed as rewrite systems (e.g. as first-order TRSs), an abstract but effective 
way to execute programs, rewriting itself, is always available. As a consequence, a natural time 
cost model turns out to be derivational complexity, namely the (maximum) number of rewrite 
steps which can possibly be performed from the given term. A rewriting step, however, may not 
be an atomic operation, so derivational complexity is not by definition invariant. For first-order 
TRSs, however, derivational complexity has been recently shown to be an invariant cost model, 
by way of term graph rewriting [S] . 

The case of A-calculus is definitely more delicate: if /3- reduction is weak, i.e., if it cannot 
take place in the scope of A-abstractions, one can see A-calculus as a TRS and get invariance by 
way of the already cited results [10] , or by other means [19] . But if one needs to reduce "under 
lambdas" because the final term needs to be in normal form (e.g., when performing type checking 
in dependent type theories), no invariance results arc known at the time of writing. 

In this paper we give a partial solution to this problem, by showing that the unitary cost 
model is indeed invariant for the A-calculus endowed with head reduction, in which reduction can 
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take place in the scope of A-abstractions, but can only be performed in head position. Our proof 
technique consists in implementing head reduction in a calculus of explicit substitutions. 

Explicit substitutions were introduced to close the gap between the theory of A-calculus and 
implementations [T]. Their rewriting theory has also been studied in depth, after McUics showed 
the possibility of pathological behaviors |14] . Starting from graphical syntaxes, a new at a dis- 
tance approach to explicit substitutions has recently been proposed [7]. The new formalisms are 
simpler than those of the earlier generation, and another thread of applications — to which this 
paper belongs — also started: new results on A-calculus have been proved by means of explicit 
substitutions [71[B]. 

In this paper we use the linear-substitution calculus A^.j, a slight variation over a calculus of 
explicit substitutions introduced by Robin Milner |15| . The variation is inspired by the structural 
A-calculus [7] . We study in detail the relation between A-calculus head reduction and linear head 
reduction [13j . the head reduction of A[.], and prove that the latter can be at most quadratically 
longer than the former. This is proved without any termination assumption, by a detailed rewriting 
analysis. 

To get the Invariancc Theorem, however, other ingredients are required: 

1. The Subterm Property. Linear head reduction has a property not enjoyed by head /3-reduction: 
linear substitutions along a reduction t —o* u duplicates subtcrms of t only. It easily follows 
that — o-steps can be simulated by Turing machines in time polynomial in the size of t and the 
length of —o*. This is explained in Section [31 

2. Compact representations. Explicit substitutions, decomposing /3-reduction into more atomic 
steps, allow to take advantage of sharing and thus provide compact representations of terms, 
avoiding the exponential blowups of term size happening in plain A-calculus. Is it reasonable 
to use these compact representations of A-terms? We answer affirmatively, by exhibiting a 
dynamic programming algorithm for checking equality of terms with explicit substitutions 
modulo unfolding, and proving it to work in polynomial time in the size of the involved compact 
representations. This is the topic of Section [5] 

3. Head simulation of Turing machines. We also provide the simulation of Turing machines by 
A-terms. We give a new encoding of Turing machines, since the known ones do not work with 
head /3-reduction, and prove it induces a polynomial overhead. Some details of the encoding 
are given in Section [HI 

We emphasize the result for head /3-reduction, but our technical detour also proves invariance for 
linear head reduction. To our knowledge, we are the firsts to use the fine granularity of explicit 
substitutions for complexity analysis. Many calculi with bounded complexity (e.g. [20]) use let- 
constructs, an avatar of explicit substitutions, but they do not take advantage of the refined 
dynamics, as they always use big-steps substitution rules. 

To conclude, we strongly believe that the main contribution of this paper lies in the technique 
rather than in the invariance result. Indeed, the main open problem in this area, namely the 
invariance of the unitary cost model for any normalizing strategy remains open but, as we argue 
in Section [4l seems now within reach. 

2 A-Calculus and Cost Models: an Informal Account 

Consider the pure, untyped, A-calculus. Terms can be variables, abstractions or applications and 
computation is captured by /3-reduction. Once a reduction strategy is fixed, one could be tempted 
to make time and reduction steps to correspond: firing a /3-redex requires one time instant (or, 
equivalently, a finite number of time instants) and thus the number of reduction steps to normal 
form could be seen as a measure of its time complexity. This would be very convenient, since 
reasoning on the complexity of normalization could be done this way directly on A-tcrms. However, 
doing so one could in principle risk to be too optimistic about the complexity of obtaining the 
normal form of a term t, given t as an input. This section will articulate on this issue by giving 
some examples and pointers to the relevant literature. 

Consider the sequence of A-terms defined as follows, by induction on a natural number n (where 
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u is the lambda term yxx): to = u and for every n G N, tn+i = {Xx.tn)u. tn has size linear in n, 
and tn rewrites to its normal form r„ in exactly n steps, following a leftmost-outermost strategy: 

to = u = ro 

ti yuu = yroyro = ri 

t2 {Xx.to){yuu) = {Xx.u){ri) yriri = r2 

For every n, however, r„+i contains two copies of r„, hence the size of r„ is exponential in n. 
As a consequence, if we stick to the leftmost-outermost strategy and if we insist on normal forms 
to be represented explicitly, without taking advantage of sharing, the unitary cost model is not 
invariant: in a linear number of /3-step we reach an object which cannot even be written down in 
polynomial time. 

One may wonder whether this problem is due to the specific, inefficient, adopted strategy. 
However, it is quite easy to rebuild a sequence of terms exhibiting the same behavior along an 
innermost strategy: if s = Xy.yxx, then define vq to be just Xx.s and for every n S N, Vn+i 
to be Xx.Vn-s. Actually, there are invariant cost-models for the A-calculus even if one wants to 
obtain the normal form in an explicit, linear format, like the difference cost model [5]. But they 
pay a price for that: they do not attribute a constant weight to each reduction step. Then, 
another natural question arises: is it that the gap between the unitary cost model and the real 
complexity of reducing terms is only due to a representation problem? In other words, could we 
take advantage of a shared representation of terms, even if only to encode A-terms (and normal 
forms in particular) in a compact way? 

The literature offers some positive answers to the question above. In particular, the unitary cost 
model can be proved to be invariant for both call-by-name and call-by-value A-calculi, as defined 
by Plotkin [T7j. In one way or another, the mentioned results are based on sharing subterms, cither 
by translating the A-calculus to a TRS [TU] or by going through abstract machines [H]. Plotkin's 
calculi, however, are endowed with weak notions of reduction, which prevent computation to 
happen in the scope of a A- abstraction. And the proposed approaches crucially rely on that. 

The question now becomes the following: is it possible to prove the invariance of the unitary 
cost model for some strong notion of reduction? This paper gives a first, positive answer to 
this question by proving the number of /3-reduction steps to be an invariant cost model for head 
reduction, in which one is allowed to reduce in the scope of A-abstraction, but evaluation stops 
on head normal forms. 

We arc convinced that the exponential blowup in the examples above is, indeed, only due to 
the A-calculus being a very inefficient representation formalism. Following this thesis we use terms 
with explicit substitutions as compact representations: our approach, in contrast to other ones, 
consists in using sharing (here under the form of explicit substitutions) only to obtain compactness, 
and not to design some sort of optimal strategy reducing shared rcdexes. Actually, we follow the 
opposite direction: the leftmost-outermost strategy — being standard — can be considered as the 
maximally non-sharing strategy. How much are we losing limiting ourselves to head reduction? 
Not so much: in Section [S] we show an encoding of Turing machines for which the normal form 
is reached by head-reduction only. Moreover, from a denotational semantics point of view head- 
normal forms — and not full normal forms — are the right notion of result for /3-reduction. 

The next two sections introduce explicit substitutions and prove that the length of their head 
strategy is polynomially related to the length of head /3-rcduction. In other words, the switch to 
compact representations does not affect the cost model in an essential way. 

3 Linear Explicit Substitutions 

First of all, we introduce the A-calculus. Its terms are given by the grammar: 

t,u,r eT\ :■■ X \TxTx\ Xx.Tx 
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and its reduction rule -^p is defined as the context closure of {Xx.t) u t{x/u\. We will mainly 
work with head reduction, instead of full /3-reduction. We define head reduction as follows. Let 
an head context H be defined by: 

H ::= [•] \HTx\ Xx.H 

Then define head reduction as the closure by head contexts of i— >-^. Our definition of head 
reduction is slightly more liberal than the usual one. Indeed, it is non-deterministic, for instance: 

{Xx.I) t h ^ (Aa;.(/ /)) t I I 

Usually only one of the two redexes would be considered an head redex. However, this non- 
determinism is harmless, since one easily proves that 

Lemma 1 — >h has the diamond property, namely ift Ui with i — 1,2 then there exists r such 
that Ui — >h r. 

Reducing — >h in an outermost way we recover the usual notion of head reduction, so our approach 
gains in generality without loosing any property of head reduction. Our notion is motivated by 
the corresponding notion of head reduction for explicit substitutions, which is easier to manage in 
this more general approach. 

The calculus of explicit substitutions we are going to use is a minor variation over a simple 
calculus introduced by Milner [T5|. The grammar is standard: 

t,u,r er-.-.x ITT \ Xx.t ITlx/T] 

The term t[x/M] is an explicit substitution. Both costructors Xx.t and i[a;/u] bind x in t. We note 
L a possibly empty list of explicit substitutions . . . [xk/uk\. Contexts are defined by: 

C, D, E, F :: [•] \ CT\T C\ Xx.C \ C[x/T] \ T[x/C] 

We note C[t] the usual operation of substituting [■] in C possibly capturing free variables of t. We 
will often use expressions like C[a;][a;/M] where it is implicitly assumed that C does not capture 
X. The linear- substitution calculus A[.] is given by the rewriting rules — ^-dB, ^^is and — ^-gc, defined 
as the context closure of the rules M-dB, n-is and t-^gc in Figure [1] We also use the notation 
— ?>A[ ]=->dB U U — T'gc and — >'s=— >is U ^>gc. Rule — >-dB acts at a distance: the function Xx.t 
and the argument u can interact even if there is L between them. This is motivated by the close 
relation between A[.] and graphical formalisms as proof- nets and A j -dags [21 [S], and is also the 
difference with Milner's presentation of A[.] [TS]. 

The linear-substitution calculus enjoys all properties required to explicit substitutions calculi, 
obtained by easy adaptations of the proofs for Aj in [7]. Moreover, it is confluent and preserves 
/3-strong normalization. In particular, — >-s is a strongly normalizing and confluent relation. 

Given a term t with explicit substitutions, its normal form with respect to — !>s is a A-tcrm, 
noted t\., called the unfolding of t and verifying the following equalities: 

{tu)i=tiui {Xx.t)i^ Xx.ti {t[xlu])i=ti{x/u\} 

Another useful property is the so-caWed full-composition, which states that any explicit substitution 
can be reduced to its implicit form independently from the other substitutions in the term, formally 
^[x/m] — t{x/u}. Last, A[.] simulates A-calculus (t — u implies t -^^i j ''^) ^^'^ reductions in A[.] 
can be projected on A-calculus via unfolding (t — >A[.] u implies ^4-^^ "4-)- 

The calculus A[.] has a strong relation with proof-nets and linear logic: it can be mapped to 
Danos' and Regnier's pure proof- nets [18] or to A j -dags [5]. The rule — J-dB corresponds to proof- 
nets multiplicative cut-elimination, — >-is to the cut-elimination rule between ! (every substitution 
is in a !-box) and contraction, — >gc to the cut-elimination rule between ! and weakening. The case 
of a cut between ! and dereliction is handled by — ^is , as if cut derelictions were always contracted 
with a weakening. 
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{Xx.t)Lu h->^B t[x/u]L 

C[x][x/u] i-^is C[u][a;/u] 

t[a;/u] H>gc t if X ^ 

Figure 1: A[.] rewriting rules 

3.1 Linear Head Reduction, the Subterm Property and Shallow Terms 

In this paper, we mainly deal with a specific notion of reduction for A[.], called linear head reduction 
[131 [12, and related to the representation of A-calculus into linear logic proof- nets. In order to 
define it we need the notion of head context for explicit substitutions. 

Definition 1 (iiead context) Head contexts are defined by the following grammar: 

H ::= [•] I HT\\x.H \ H[x/T]. 

The fundamental property of an head context H is that the hole cannot be duplicated nor erased. 
In terms of linear logic proof-nets, the head hole is is not contained in any box (since boxes are 
associated with the arguments of applications and with explicit substitutions). We now need to 
consider a modified version of i— >is: 

-ff [a;] [x/ii] —°is H[u\[x/u\ 

Now, let -^dB (resp. ^is) be the closure by head contexts of i— >dB (resp. ^is). Last, define 
head linear reduction — o as — OdB U ^is. Please notice that — o can reduce under A, for instance 
Xy.{H[x][x/u\) — Xy.{H[u][x/u]). Our definition of — o gives a non-deterministic strategy, but 
its non-determinism is again harmless: 

Lemma 2 — o has the diamond property. 
Proof. The only possible critical pair is given by 

i i 
H,[H2[x][y/v]L][x/u]^ Hi[H2[u][y/v]L][x/u] 

There is also the possibility that a ^-redex R is contained into another ^-redex R', but then R' 
cannot duplicate R: 



Hi [ {Xy.H2 [ {Xx.t)L2 u])Liv] -o Hi \H2 [ {Xx.t)L2 u ] [y/v]Li] 

I A 

H,fl^%m2[l[x/u\L2\)li ^ Hi[H2[t[x/u]L2][y/v]Li] 

There are no other cases, in particular no term can have two disjoint — o-redexes. □ 

A term w is a box-subterm of a term t (resp. of a context C) if t (resp. C) has a subterm of 
the form r u or of the form r[x/u] for some r. 

Remark 1 By definition of head- contexts, [■] is not a box-subterm of H[-], and there is no box- 
subterm of H[-] which has [■] as a subterm. 

Proposition 3 (Subterm Property) // t u and r is a box-suterm of u, then r is a box- 
subterm of t. 
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The aforementioned proposition is a key point in our study of cost models. Linear head substitution 
steps duphcate sub-terms, but the Subterm Property guarantees that only sub-terms of the initial 
term t are duplicated, and thus each step can be implemented in time polynomial in the size of 
<, which is the size of the input, the fundamental parameter for complexity analysis. This is in 
sharp contrast with what happens in the A-calculus, where the cost of a /3-reduction step is not 
even polynomially related to the size of the initial term. 

Proof. By induction on the length k of the reduction t — o* u. Suppose A: > 0. Then t — o* ij — o u 
and by i.h. any box-subtcrm of u is a box subterm of t, so it is enough to show that any box-subterm 
of u is a box-subterm of v. By induction on u — o u: 

• Base cases: 

• u = (Aa:.r)L s —o r[x/s]L = u: it is evident that the two terms have the same box-subterms. 

• V = H[x\[x/ s\ —o H[s][x/s] = u: by a previous remark the plug of s in H[-] does not create 
any new box-subterm, nor modify a box-subterm of H[-\. And obviously any box-subterm 
of s is a box-subterm of v. 

• Inductive cases: just use the i.h. and the previous remark. 

□ 

The subterm property does not only concern the cost of implementing reduction steps, but also 
the size of the end term: 

Corollary 4 There is a polynomial p : N x N — !> N such that if t -^^ u then \u\ < p{k, \t\). 

Consider a reduction t —o* u where t is a A-term. Another consequence of the Subterm Property 
is that for every explicit substitution occurring in u, the substituted term is a A-tcrm. This is 
another strong property to be used in the analysis of the next section. 

Definition 2 (Shallow Terms) A A[.]-ierm t is shallow if whenever t ~ C[u[x/r]\ then r is a 
X-term. 

Corollary 5 Let t be a X-term and t —q* u. Then u is shallow. 

Proof. By lemma [3] the content of any substitution of u is a subterm of t, i.e. a A-term. □ 

4 On the Relation Between A and A[.] 

In this section, linear explicit substitutions will be showed to be an efficient way to implement 
head reduction. We will proceed by proving three auxiliary results separately: 

1. We show that any ^-reduction p projects via unfolding to a — >h-reduction pj, having as length 
exactly the number of steps in p; this is the topic of Section 14.11 

2. We show the converse relation, i.e. that any — >h-reduction p can be simulated by a — o-reduction 
having as many — o^e-steps as the the steps in p, followed by unfolding; this is in Section 14.21 

3. We show that in any -^-reduction p the number of -^is-steps is 0{\pW^) where |p|dB is the 
number of — OdB steps in p. By the previous two points, there is a quadratic — and thus 
polynomial — relation between — ?>h-reductions and ^-reduction from a given term; all this is 
explained in Section l4?3l 

4.1 Projection of -<> on — j-h 

The first thing one needs to prove about head-linear reduction is whether it is a sound way to 
implement head reduction. This is proved by relatively standard techniques, and requires the 
following auxiliary lemma, whose proof is by induction on t — j-h u: 

Lemma 6 Let t G T\. If t — s^h u then t{x/r} — !>h u{x/r}. 

Lemma 7 (Projection of -o on ^-h) Lett G T. If p : t -^'^ u thent^^"^ u^andn = \p\dB < k. 
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Proof. By induction on k. If fc = it is trivial, so let fc > and t ^"^^ r ~o u. Let r be the 
reduction t r. By i.h. we get tj,— 5-™ r4,and m = |T|dB < fc — 1. Now consider r u. There 

arc two cases: 

• If r —o-is u then 7"^,= u^, by definition of as the normal form of — >s (which contains ^is). 

• If r — OdB u then r — H[{Xx.v)L s] ^ i7 [t;[a;/s]L] = u. We prove that 7'^— >h ^1, from which it 
follows that ij,-^™^^ u\, where m + l = \t\^^ + 1 = |p|dB < k. 

By induction on H . We use the following notation: lit^uL and L = [yi/iyi] . . . [ym/wm] then 
we write CTl for {yi/wi\\ . . . {ym/wmW-, thus we can write u\sj-l. Cases: 
• H =[■]. Then: 

rl = {{\x.v)h)isl = (def. of (-U) 

{\x.v\cj-l) si ->h 

■yifLla^/si} = a; ^ fv(it;j) for i G {1, . . . , m} 

vi{xls\^ui. = (def. of (Oi) 

(«[a:/s])>L = (def. of 

{v[x/s]h)i = 



• H ^ H' w. Then 



• i? = Ay.iJ'. Then 



H = H'[y/w\. Then 



= {H'[{\x.v)h s])iwi (*./i.) 
{H'[v[xls]L])iwi = Mi 



= \y.{H'[{\x.v)h s])l (i.h.) 
Xy.iW[v[x/s]L])l = ul 



ri = {H'[{Xx.v)L sMy/wi} {i.h. & 

iH'[v[x/.s]LMy/wi} = ui 



□ 



4.2 Projection of — t-ji on ^ 

Here we map head /3-steps to head linear steps followed by unfolding. In other words, we prove that 
head-linear reduction is not only a sound, but also a complete way to implement head reduction. 
This section is going to be technically more involved than the last one. First of all, we show that a 
single head step can be simulated by a step of — ^dB followed by unfolding, which is straightforward: 



Lemma 8 (Head Simulation) Let t be a X-term. Then t -T-h u then t ^dB^'s 

Proof. By induction on t — i-h u. The base case: if (Xx.u) r — u{x/r} then {Xx.u) r ^dB ^[a;/?'] 
and ^[x/r] — >■* u{x/r} by full composition. The inductive cases follows by the i.h.. □ 

We are now going to show that a sequence —^l^ can be factored into some head-linear substitutions, 
followed by non-head-linear ones. This allows to refine Lemma [5] Define =>s as the relation 
— \ — °is, i-c. =>s reduces non- head-linear substitution redexes. Moreover, define the linear 
unfolding tl of t as the normal form of t with respect to (which exists since ^is^^-is and 
— ^is terminates, and it is unique because — ois is deterministic). We also need the following 
abstract lemma about postponement of reductions: 

Lemma 9 Define — >i,2 as — !>i U -^2- Then: 
1- If ^*2^iQ^*i^*2 then ^l^C^l^l. 
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2. ( Geser-Di Cosmo-Piperno) If ^2 ^''^d, -^i is strongly normalising then — 5'2^'i'^^'i'^2, 

and so — >* 2^^i^2- 

Proof. 1. By induction on the number k of ^-i steps in t : < — j-J 2 The case A: = is trivial. 
Let fc > 0. Then if t is not of the form of the statement it has the foUowing form: 

t^l^t ^1^*1.2 " 

Using the hypothesis we get 
The i.h. on — !'2^i 2 gives: 
and we conclude. 

2. We prove the first consequence, the second one follows from the first and Point 1. Let r : 
t -^2^1 -By induction on the pair {r]{t),k), where r]{t) is the length of the maximal ->i 
reduction from t, ordered lexicographically. The case fc = is trivial, then let fc > 0. Now, if 
r =^2^^ ^2^1^!^^ then by applying the first hypothesis to the central subesequence we get 
T C— >-2^^— ^2^']*^^. The measure of the prefix -^2~^~^t {v{t)^k ~ 1) hence by i.h. we 
get r ^2^1^^. For the suffix — >2^2^i^^ ^hc first component of the measure 

which decreases, since its starting term is obtained through a — >j'"-reduction from t, so one can 
apply the i.h. and get r C— >-^— >-2. 

□ 



t ^t^^^-2^1,2 

t -^*i^l^*i^*2 U 



Now, we can prove that any — step is simulated in ~°dB^is^l (actually, in such a sequence 
there can be at most one step): 

Lemma 10 (Unfolding Factorization) The following swaps hold: 

1. =^s^isC-o]'3=^>+, precisely: =^s-^isC^is^s U ^is^is^s U ^i3=>s=>3. 

2. ^'is^^is^^s; '^'^'^ particular t — tj, t\. 

Proof. 1) Formally, the proof is by induction on t =>s u, see [2] for the technical details (and 
more generally for a detailed study of redex permutations in A[.]). Informally, cannot create, 
duplicate or alter the head nature of an — ois-step, so that the second step in t =>s^is ^ can be 
traced back to a unique — ois redex. Now, there are two cases: 

1. The two redexes simply permute. 

2. The preponement of — ois duplicate the redex reduced by =^>s. Two subcases: 

1. One of the two residuals is a linear head redex. For instance, consider the diagram: 

x[x/y][y/z] =^s x[x/z][y/z] 

Als ^ ils 

y[x/y][y/z] -^is z[x/y][y/z] =>3 z[x/z][y/z] 

2. Both residuals are non-linear-head redexes: 

x[x/yy][y/z] x[x/yz][y/z] 

ils ^ ils 

{y y)[x/y y][y/z] {y z)[x / y y][y / z] {y z)[x/y z][y/z] 

2) Apply Lemma|ni2 taking -H-i:=— ois, ^2:==^s and since -^is is strongly normalizing (from 
-^isC^-is and termination of — t-is) we get (^is U =>s)* Q~°is'^Ij and conclude since — t-is^^is 
U =^s- □ 

We know that a step is simulated by a sequence of the form ^dB^is=^s^^*=^s- Consider 
two (or more) — >-h-steps. They are simulated by a sequence of the form ^*^*^*=>*, while we 
would like to obtain ^*=>*. What we need to do is to prove that a sequence of the form ^*^dB 
can always be reorganized as a sequence -^^dB^^s- 
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Lemma 11 The following inclusions hold: 

1- =>s^dB^— °dB=>s- 

Proof. 1. By induction on t u, see [5] for the technical details. The idea is that =>s can- 
not create, duplicate nor alter the head nature of — OdB redexes, and so the — ^dB-step can be 
preponed. Conversely, ^dB-steps cannot erase, duplicate nor alter the non-head nature of =>s 
redexes, therefore the two steps commute. 

2. Let t =>s^dB u- The proof is by induction on k using point 1, and it is a standard diagram 
chasing. 

□ 

The next lemma is the last brick for the projection. 
Lemma 12 //tj,— >h u then there exists r s.t. t^ — OdB r -^1^ 

Proof. By lemma [8] we get t^^— By lemma [TOl t — j-^^ tj, factors as t —o^^ t^ =>* t\, and so 
we get t[ =>l^^ls u. By lemma [TTfe] we get t[ -^^1^1^ u, i.e. t[ -^^{^ u. □ 

We can then conclude with the result which gives the name to this section: 

Lemma 13 (Projection of — on ^) Lett he a X-term. Ift —5-^ u then there exists a reduction 
p s.t. p : t —o* r, with r -^'[^ u and \p\is, = k. 

Proof. By induction on k. The case fc = is trivial, so let A: > and t — 5-^ "'^ s — u. By i.h. 
there exists a reduction r s.t. r : t —°* v, v -^^^ s and |r|dB = fc — 1. Since s is a A-tcrm we have 
that ^4,= s and wj,— )-h u- By lemma fT2l there exists r s.t. uj, — o^jg r — u. Moreover, v —o^^ vl; 
call 7 this reduction. Let p be the reduction obtained by concatenating r : t —o* v, "f : v -o^^ ''^i 
and vl -^dB We have |p|dB = I'^'IdB + 1 = k and r -^^^ u, and so we conclude. □ 

4.3 Quadratic Relation 

The last two sections proved that head reduction can be seen as head linear reduction followed 
by unfolding, and that the number of head steps is exactly the number of multiplicative steps 
in the corresponding head linear reduction. To conclude that head and head linear reduction 
are polynomially related we need to show that the number of exponential steps in a linear head 
reduction p is a polynomial of the number of multiplicative steps in p. 

We do it by giving a precise estimate of the maximal length of a — reduction from a given 
term. Intuition tells us that any reduction t — u cannot be longer than the number of explicit 
substitutions in t (number noted es(t)), since any substitution in t can act at most once on the 
head variable. However, a formal proof of this fact is not completely immediate, and requires to 
introduce a measure and prove some (easy) lemmas. 

The idea is to statically count the length of the maximum chain of substitutions on the head, 
and to show that this number decreases at each head linear substitution step. Let us give an 
example. Consider the reduction: 

t ^ {x y)[x/y r][y/u\ {{y r) y)[x/y r][y/u\ -o^s {{u r) y)[x/y r][y/u] 

It is easy to establish statically on t that [y /u] will give rise to the second ^is-stcp, since y is the 
head variable of y r, which is what is going to be substituted on the head variable of t, i.e. [y/u] is 
an hereditary head substitution of t. We use this idea to define the measure. Note that, according 
to our reasoning, [y/u] is an hereditary head substitution also for s — {x y)[x/{y r)[y/ii]], but we 
get around these nested cases because we only have to deal with shallow terms. 
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Definition 3 Hereditary head contexts are generated by the following grammar: 



HH := H\ HH[x][x/H] 
The head measure \t\HH of a shallow term t is defined by induction on t: 

\x\hh - \t[x/u]\HH = \t\HH lft^HH[. 

\\x.t\HH = \t\HH \t[x/u]\HH = + l otherwise 

\t u\hH = \t\HH 

Please notice that \t\HH = for any A-term t. We need a lemma about _ff_ff-contexts. 
Lemma 14 Let t be a shallow term. 

1. If \t\HH = and t = HH[x] then the context HH[-] is an head context. 

2. If t is shallow, t —°\s u, t = HII[x] and IIH[-] does not capture x, then there exists HHq s.t. 
u = HHi:,[x\ and HHq does not capture x. 

Proof. 1. By induction on t. If i = r[y/u] the hypothesis \t\HH ~ implies that r ^ HH[y] 
and |r|^f^f =0. li t = HH[x] for some x then HH[-] = HHQ[-][y/u] for some hereditary head 
context HHq, and r = HHo[x]. By i.h. HHq is an head context, and so is HH. All other 
cases are straightforward. 

2. By induction on t -^is u. There are two interesting cases: 

• Base case: t = H[y][y/r\ —o\s H[r\[y/r] = u. Then r = i?o[-'J^] a-nd u = H[HQ[x\][y / r\. We 
conclude by defining HHq := H[HQ][y/r\. 

• Substitution inductive case: t = rly/v] —°is s[y/v] ~ u because r — s. There are two 
subcases: 

• t = HH[x\[y/v\ (and x ^ y). Then by i.h. exists HHq s. t. s ~ HHq[x\ and so we 
conclude taking HHq := HHQ[y/v]. 

• t = HH[y][y /v] and v = H[x]. Then by i.h. exists HHq s. t. s = HH^ly] and so we 
conclude taking HHq := HH[)[y][y/H]. 

This concludes the proof. □ 

Next lemma proves that \t\HH correctly captures the number of ^is-reductions from t, what is 
then compactly expressed by the successive corollary. 

Lemma 15 (| ■ \hh decreases with — ois) Let t be a shallow term. 
L t is a —oj^^ -normal form iff \t\HH = 0. 

2. t -^is u implies \t\HH ~ \u\hh + 1- 

3. \t\HH > implies that t is not -^\s-normal. 

Proof. 1. By induction on t. The only interesting case is if t = u[a;/r]. Then u is a ^is- 
normal form and so by i.h. we get = 0. By hypothesis u ^ i?[a;], otherwise t would not 

be ^is-normal. By Lemma [14111 if u = HH[y] then HH is an head context. Since for any 
term v there is exactly one head context Hq s.t. v — Hq[s] and s is a variable, we get that 
HH = H and u ^ HH[x]. By definition of | • \hh we get \t\HH = \u\hh = 0. 
<;=) By induction on t. The only interesting case is if t = u[x/r]. The hypothesis implies that 
\u\hh = and u ^ HH[x\. In particular, u ^ H[x] and -[x/r] does not give an ^ig-redex. 
By i.h. u is a — oig-normal form. Then t is a — o^^g-normal form. 

2. By induction on t — u. Cases: 

• t — H[x\[x/r] —Ois H[r][x/r] ~ u. Since x is free in H[x\ we get that H[x\ is a ^i^-normal 
form, and by Point [T] = 0. It follows that \t\HH = 1- The hypothesis that t is 
shallow implies that r is a A-term and the hypothesis ff[a;][a;/r] i/[r][a::/r] implies that 
H does not capture any free variable of r. Then H[r] is a — o^^g-normal form and by Point 
n \H[r]\HH = 0. By lemma [HT] if H[r] has the form HH[y] for some y then HH[-] is an 
head context, and so y is a variable of r. Since x ^ f v(r) we get x ^ y and H[r] has not 
the form HH[x]. Hence |7?[r] [x/r] = \u\hh = 0, and \t\HH — \u\hh + 1- 
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• t = Xx.r -^is A.T.S = u. Using the i.h.. 

• t = r V -ois s V = u. Using the i.h.. 

• t — r[x/v\ — s[2;/?;] = u. By i.h. \r\HH = \s\hh + 1- The measures and are 
given by |r|/f^f and |s|/f/f , respectively, plus the eventual contribution of [a^/w]. By lemma 
I14I2I either [x/v] contributes to both \t\HH and or it contributes to none. In both 
cases we get \t\HH = \u\hh + 1- 

3. By induction on t. The base case t = x is trivial, and the cases t = Xx.u and t = u r follows 
from the i.h.. The only interesting case is when t = u[x/r]. If \u\hh > then we conclude 
using the i.h.. Otherwise, it must be that u = HH[x]. By Lemma [141 II iJg is an head context 
and so t = H[x][x/r] -^is H[r][x/r] and we conclude. 

□ 

Summing up, we get: 

Corollary 16 (exact bound to ^is-sequences) t iff n ~ \t\HH. 

Proof. By induction on n. For n = the statement is given by Lemma [15111 Then let n > 0. 
Direction if t — ois u — Ois^^ tl then by i.h. \u\hh = n — 1. By Lemma I15I2I we get = n. 

Direction ■<=: by Lemma I15I3I t is not — ois-normal and so t —°i_s u. By Lemma I15I2I we get 

\u\hh = n — 1 and by i.h. u — ^iV^ = ^ii and so t -^^s ^i- ^ 

Now, we are ready to prove the quadratic relation. The following lemma is the key point for the 
combinatorial analysis. It shows that if the initial term i of a reduction p : t —o^ u is a A-term 
then |M|_f/_H is bounded by the number of steps in p (noted |/o|dB)- 

Lemma 17 Let t e 7a- If P ■ t u then \u\hh < es(u) = |p|dB- 

Proof. Note that by definition of | • \hh we get \u\hh < es(u) for any term u. So wc only need 
prove that es(M) = |p|dB- By induction on k = \p\dB- If A: = then p is empty, because f is a 
A-term and so it is — oig-normal. Then t = u and es{u) = 0. If fc > then p = t; -Od^', ~^Ts ^r 
some m and some reduction r. Let r be the end term of r and s the term s.t. r -OdB s — u. By 
i.h. es(r) = |r|dB = |p|dB — 1- Now, es(s) = es(r) + 1 = |p|dB, because each — OdB-step creates an 
explicit substitution. It is easy to see that ^is-steps do not change the number of substitutions in 
a term (i.e. es(.s)): by lemma[3]we get that any box subterm of s is a box-subterm of t, and since 
t is a A-term, the duplication performed by a ^is-step does not increase the number of explicit 
substitutions. Therefore, es[u) = es(s) = |p|dB- D 

We finally get: 

Theorem 18 Lett eT\. If p-t u then n = 0{\p\l^). 

Proof. There exists fc £ N s.t. p = ri; 71; . . . ; r^; 7^, where is a non-empty — OdB-reduction and 
7i is a — ois-reduction for i G {1, . . . , fc} and it is non-empty for i £ {l,...,fc — 1}. 
Let ri,...,rfe be the end terms of Ti,...,Tfe, respectively. By Corollary [TOl < \rj\HH and by 
Lemma[I7]|rj|//H < J2ie{i,....3} I'^'l- ^O'^ l/^ldB = J2ie{i fc} I'^'l bounds every \rj\HH, hence: 

hi\< \ri\HH < k ■ \p\dB 

ie{l,...,k} ie{l,...,k} 

But fc is bounded by |p|dB too, thus E,;G{i,...,fe} < IpldB and n < \p\l^ + \p\dB = 0{\p\l^). □ 
Putting together the results from the whole of Section |4j we get: 

Corollary 19 (Invariance, Part I) There is a polynomial time algorithm that, given t G 7\, 
computes a term u such that uj, = r if t has -^t^-normal form r and diverges if u has no — >h- 
normal form. Moreover, the algorithm works in polynomial time on the derivation complexity of 
the input term. 
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One may now wonder why a result like Corollary 1191 cannot be generalized to, e.g., leftmost- 
outermost reduction, which is a normalizing strategy. Actually, linear explicit substitutions can be 
endowed with a notion of reduction by levels capable of simulating the leftmost-outermost strategy 
in the same sense as linear head-reduction simulates head-reduction here. And, noticeably, the 
subterm property continues to hold. What is not true anymore, however, is the quadratic bound 
wc have proved in this section: in the leftmost-outermost strategy, one needs to perform too many 
substitutions not related to any /3-redex. If one wants to generalize Corollary 1191 in other words, 
one needs to further optimize the substitution process. But this is outside the scope of this paper. 

5 A[.] as an Acceptable Encoding of A-terms 

The results of the last two sections can be summarized as follows: linear explicit substitutions 
provide both a compact representation for A-terms and an implementation of /3-reduction in such a 
way that the overhead due to substitutions remains under control and is polynomial in the unitary 
cost of the A-term we start from. But one may wonder whether explicit substitutions are nothing 
more than a way to hide the complexity of the problem under the carpet of compactness: what 
if we want to get the normal form in the usual, explicit form? Counterexamples from Section [21 
read through the lenses of Theorem [T9l tell us that that this is indeed the case: there are families 
of A-terms with polynomial unitary cost but whose normal form intrinsically requires exponential 
time to be produced. 

In this section, we show that this phenomenon is due to the A-calculus being a very inefficient 
way to represent A-terms: even if computing the unfolding of a term t G A[.] takes exponential 
time, comparing the unfoldings of two terms t,u G A[.] for equality can be done in polynomial time. 
This way, linear explicit substitutions are proved to be a succint, acceptable encoding of A-terms 
in the sense of Papadimitriou |16j . The algorithm we are going to present is based on dynamic 
programming: it compares all the relative unfoldings of subterms of two terms t and u as above, 
without really computing those unfoldings. Some complications arise due to the underlying notion 
of equality for A-terms, namely a-equivalence, which is coarser than syntactical equivalence. But 
what is the relative unfolding of a term? 

Definition 4 (Relative Unfoldings) The unfolding tl^ of t relative to context C is defined by 
induction on C: 

Constraining sets allow to give sensible judgments about the equivalence of terms even when their 
free variable differ: 

Definition 5 (Constraining Sets and Coherence) A constraining set A is a set of pairs (x, y) 
of variable names. Two constraining sets A and B are coherent ( noted A ^ B) if: 

• (a;, y) G A and (x, z) G B imply y ~ z; 

• {y,x) e A and {z,x) G B imply y ~ z. 

Moreover, A is auto-coherent if A ^ A. Observe that ^ is not reflexive and that a constraining 
set is auto-coherent iff it is the graph of a bijection. 

The algorithm tests pairs (a, b) of terms. We assume them preprocessed as follows: the spaces 
of substituted, abstracted and free names of a and b are all pairwise disjoint and neither a nor b 
contain any subterm in the form c[x/d], where X ^ f v(cfl We also note S the set of substituted 
variables of both terms. The whole algorithm is built around the notion of an unfolding judgment: 



Any term can be turned in this form in polynomial time, by — >gc-normalizing it. 
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Positive rules: 

^■^^-^ {t,C[[-]r])Z(u,D[[-]s]) (r,C[t [■]])-(«, 



■ var 

{(^,y)} 



(x, C) ' ~ (y, D) (t r, C) ~ (u s, D) 

{t,C[E[x\[x/[-\\\)^{u,D) {t,C)?t{u,D[E[x\[x/[\]\) 

■ unf ; unf 



{x, C[E[x/t\]) ~ (n, D) {t, C) ~ (x, D[E[x/u\\) 

{t,C[\x.[-]]) ~ K JI[Ai/.[-]]) g t' (t,C[[-][x/r]]) ~ 



(Ax.t, C) "^^'-'"'^ (Aj;.«, D) {t[x/r],C) ^ (u, D) 



sub; 



(t,C[\x.[-]])^{u,D[\y.[-]]) yz.{x,z),(z,y)^v {t, C) ^ (t, D[l][x/r]]) 

■ A2 SUbr 



{Xx.t, C) ~ {Xy.u, D) {t, C) ~ {u[x/r] , D) 

Error generation rules: 

(f, C[\x\-]\) ~ (it, D[\y.[-]]) {3z ^ y.{(x, z)} &v)y {3z ^ x.{{z,y)} &v)yv = ± 

{Xx.t,C) ~ {\y.u,D) 
err^jQ erx^X x ^ S 



A3 



(Ax.t, C) ~ (n r, D) (« r, C) ~ (Ax.i, D) (x,C)i(nr,D) 

X is X is X is 

■ err^^A 



{Xy.t, C) ~ (x, D) {x, C) ~ (Aj/.t, D) (u r, C) ~ (x, D) 



Figure 2: Unfolding rules 



Definition 6 (Unfolding Judgments) Let P = (a, h) he a preprocessed pair. An unfolding 
judgement is a triple (t,C),v,{u, D), where v is either J- or a constraining set A, also noted 
{t,C) {u,D). The rules for deriving unfolding judgments are in Figure\^ An operation o on 
values is used, which is defined as follows: 

1. vow = vUwifv^J-=^w and v ~ w; 

2. vow = Lifv^L^iv and v ^ w; 

3. vow = J-ifv = J-orw~J-. 

The rules in Figure\^ induces a binary relation \Za,b on the space of pairs (of pairs) in the form 
{{t, C), (u, D)) such that C[t] = a and D[u] = h: (P, Q) Ea.f, [R, S) if knowing v such that P Q 
is necessary to compute w such that R ^ S . 



Lemma 20 The relation C* is a partial order, while C^j^ is a strict order. 

Proof. Let < the strict order on (N x N) x (N x N) defined as the product order of the stan- 
dard lexicographic order on N x N. Let ||C|| be the number of substitutions into which [•] is 
embedded inside C. Observe that reflexivity and transitivity of □* hold by definition. About an- 
tisimmetry, just observe that if ((t, C), {u,D)) b {{r,E), (s, P)),'thcn {{\\E\\, \E\), {\\F\\, \F\)) < 
{{\\C\\,\C\),i\\D\\,\D\)). ■ □ 

Lemma 21 For every P for a and for every Q for b there is exactly one v such that P Q. 

Proof. Let P = {t,C) and Q = {u,D). We proceed by induction on the relation □a.b, which is 
a strict order on a finite set, thus a well order. Let us distinguish some cases depending on the 
form of t and u: 

• If both t and u are variables not in iS, then we can apply rule var. 

• If both t and u are abstractions, then we can apply the inductive hypothesis to the pairs 
obtained by taking the bodies of these two abstractions and apply the inductive hypothesis. 
Depending on the (unique!) outcome, we can apply exactly one between Ai, A2 and A3. 

• If both t and u are applications, then we can apply the inductive hypothesis to the pairs 
obtained by taking the immediate subterm, and conclude observing the shape of rule @. 
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• If any of t and it is a substitution, then we can proceed as usual and then apply either rule 
sub; or sub^. 

• If any of t and u is a substituted variable, then we can again apply the inductive hypothesis, 
and the rule unf ; or unf 

• In all the other cases, we can easily conclude by observing the shape of the six rules err^®, 
err@A- errAx, err^x, err^,®, err©^:. 

This concludes the proof. □ 

Lemma 22 If P and Q are pairs for a and b (respectively) and P ri Q, then \v\ < \a\ ■ \b\. 

Proof. By induction on the structure of the proof that P ^ Q, it is easy to prove that if (x, y) S A, 
then X is free in a and y is free in b. □ 

Definition 7 (Unfolding Matrix) Let P = (a, b) be a preprocessed pair. An unfolding matrix 
M for P is bidimensional array with the following form: 

1. its rows are indexed by the pairs (i, C[-]) such that C[t\ = a; 

2. its columns are indexed by the pairs {t,C[-]) such that C[t] = b; 

3. the values of the matrix are either (possibly empty) constraining sets or _L or 

Basically, the unfolding checking algorithm simply proceeds by filling an unfolding matrix with 
the correct values, following the rules in Figure[2]and starting from an unfolding matrix filled with 



Definition 8 (Unfolding Checking Algorithm) Let P = (a, b) be a preprocessed pair. We 
define the following algorithm, which which be referred to as the unfolding checking algorithm; 

L Initialize all entries in M to ^; 

2. Ifn has no entries filled with then go to step^' 

3. Choose {t,C) and{u,D) such that K[{t,C)][{u, D)] = #, and such that n[Pi][Qi], ... ,n[Pn][Qn] 

are all different from ff, where (Pi, Qi), . . . , (P„, Qn) are the immediate predecessors of{{t, C), (w, D)) 
in \Za,b; 

4-. Compute V such that {t,C) {u,D); 

5. M[(i,C)] [(«,£>)] ^ v; 

6. Go to step\E- 

7. Return yes j/M[(a, [■])][(6, [■])] is a constraining set which is the identity, otherwise return no. 

It is now time to prove that the Unfolding Checking Algorithm is correct, i.e., that it gives correct 
results, if any: 

Lemma 23 Relative unfoldings verify: 

L If C does not contain an explicit substitution for x then x\^ = x. 



#: 



2. t[xlr]\c — ''^■lc[[-][x/r]y 



3. (Ax.t)4^ = Ax.(t4^[^^ [.]]). 

^- ^k;[E[x][x /[■]]] = ^ic[E[x/t]y 



Proof. All points are by induction on C. Relevant cases: 
• If C= [•], then: 




^iE[x][x/[-]] - ^; 
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• If C = Du, then: 

m 
m 
m 
II 

m 



liC ^ D[y/u], them 

m 
m 

m 

SI 

m 



„) '^Ud r] ''-^nlt [■]] 

= ^\d[[-\ r\ u) ^\D[t [■]] uV 

D[E[x][x /[■]]] u) = ^iD[E[x][x /[■]]] ^iD[E[x/t]] 
~ ^\D[E[x/t\] u)- 



= i.h. tiD[[-][x/r]]{yl^^ = ^'iD[[-][x/r]][y/uy 

= ^)4o{y/"i} r] ^4.D[t [.]]){?//<} 

= ^iD[[.] r]ly/u] ^lolt imv/uV 

^i.h. tiD[[-][x/r]]iy/'^-^ = *^Z5[[-][a;/r]][y/«]- 



This concludes the proof. □ 

Lemma 24 // (t, C) {u,D) and v ^ 1- then 7ri(u) = fY{tl^) and 7r2(w) = fv(u^^), where tti 
and 7r2 are the usual projections functions. 

Proof. By mduction on the derivation {t, C) ~ (m, D). The rule var is the only atomic case, and 
the statement trivially holds. The inductive cases: 

• unfi) By i.h. 7ri(w) = f v(t4^j^j^jj^^j jjj). Wc have to prove that ■ki{v) = f v(a;4^j^j^^^jj). Lemma 

gives tX^^E[x][x /[■]]] = ^\c[E[x/t]V and so we conclude. 

• unfr) As the previous case. 

• sub/) By i.h. i:i{v) ~ f v(t4^jj jj^^^jj). We have to prove that 7ri(w) = fv{t[x/r]\^). By lemma 

[lSIllt4c[[.][x/r]] ^ ^[^lAXc^ and so we conclude. 

• subr) As the previous case. 

• Ai) By i.h. 7ri(w) = fv(<^j^^ jjj). We have to prove that tti = fv{{\x.t)l^). By Lemma 
123131 Aa;.(i4^j^^ j.jj) = {\x.t)\^, and f v(Ax.(i4^j^^ j.jj)) = i^{t\c[xx.[-]]) \ i^} = ^i(^) \ {^^l- 
Similarly for (u, _D[A?/. [•]]). 

• A2) By i.h. 7ri(u) = fv(t4^j_^^ j jj). The reasoning is as in the previous point, except that the 
hypothesis (x, z) ^ v for all z implies that x ^ 7ri(w) and so iriiv) \ {x} = 7ri(i;). Similarly for 
{u,D[\y. [■]]). 

• @) By hypothesis vow ^ ±, which happens only if w 7^ ± 7^ w and u In that case 
vow = vyjw and so tti{vow) = ■ni{v) U7ri(w). Since v ^ ^- ^ w we can apply the i.h. to the 
hypothesis of the rule, and get iriiv) = fv{tl^^^-^ ^j) and 7ri(w) — r^^^^ j jj. We have to prove 
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that 7ri(?;) U 7ri(w) = fv{t r^^). By Lemma [23141 we get t rl^ ~ ^-^[[-J r] [■]]' ^^'^ 

conclude. Similarly for (m s,D). 

• For other rules ?; = _L, so there is nothing to prove. 

□ 

Definition 9 (Unifying Renaming) A constraining set v = . . . , {xk,yk)} is a unify- 

ing renaming for two X-terms t and u if v is a bijection of fv(t) and fv(w) such that: 

1. u{xi/yi} . . . {xk/vk} = t. 

2. t{yi/xi} . . .{yk/xk} = u. 

Lemma 25 // there is a unifying renaming between t and u then it is unique. 

Proof. By induction on t. Straightforward. □ 

Lemma 26 If{t,C) - {u,D) and w 7^ _L then v is auto- coherent, i.e. it is the graph of a bijection. 

Proof. By induction on the derivation (i, C) (u, D). For rule var it is obvious. For rules unf 
unfr, sub;, subj., Ai and A2 it simply follows from the i.h.. For rule @ the i.h. gives that v and w 
are two bijections. Now, vow 7^ _L implies that v ^ J- w and v w. Suppose that vow = vUw 
is not a bijection. This happens iS v ow is not a function or it is not injective, but since v and w 
are bijections both cases are absurd because oi v ^ w. □ 

Lemma 27 {t, C) - {u, D) and V auto-coherent implies v is a unifying renaming for t^^ and uj,^ . 

Proof. By induction on the derivation of {t,C) ^ {u,D), using lemma [521 Cases: 

• var) Immediate. 

• sub;) By i.h. A is a unifying renaming for i4t7[[][a;/r]] ^^^d- By lemmaHlwe get: 

Thus A is an unifying renaming for t[x/r]J^ and u^j-,- 

• subr) As the previous one. 

• Ai) By i.h. A U (x, y) is a unifying renaming for i4c[A£c [■]] ^^'^ '^■^D[\y [■]] ^^'^ tims by definition 
X e ivitlc[Xx.[-]]) ^'^'^ V ^■^("4i3[Aj/. [•]])■ lcmma[23l21wc get: 

{\x.t)i^ = Ax.(t^j^^ j.jj) 

And similarly {\y.u)i^ = \y .{ul^^^^ ^^^^) . Since fv(Ax.(t4^j^^ j.^j)) = i^{t\c[\x.[-]]) \ {^'} ^nd 
fv(A?/.(u4.^j^^ j.jj)) = f v(w|^j^y J jj) \ {y] we get that A is a unifying renaming for {Xx.t)l^ and 

• A2) By i.h. A is a unifying renaming for i4c[Ai:.[ ]] '^^D[Xy.[-]\ ^'^^ -2:), (z, y) ^ A,Vz. 
As in the previous case {\x.t)\^ = ^^.(tj^j^^ j.jj) and {\y.u)ljj = j.j]). By defi- 
nition of unifying renaming we get that x ^ ±v{t\^^^^ j jj) and y ^ fv(u4,^j^^ j jj), and since 
f^(^^[Ax.[.]]) = J.]])) and fv«^j^^ j.jj) = f v(Ay.«^j^y j.jj)) we get that ^ is a 
unifying renaming of {\x.t)l^ and (Ay.w)4,^. 

• @) By i./i. A is a unifying renaming for r] ^'^'^ ^^i'ld s]' ^^"^ -B is a unifying renaming 
for r^^jj j and s^.^^^^ j jj. By lemma [53111 we get 

'')~[c ^ *^[[.] r] ^^[t [■]] 

and 

{u s)ljj = s] ^^D[u [■]] 

Since fv((t r)4^) = fv(t4^jj.j ^j) U fv(r4^jj j.jj) (and analogously for (u 5)4.^) and A ^ B,we 
get that ^ U i? is a unifying renaming for {t r)]^ and (u s)4.£)- 
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• unfj) By i.h. A is a unifying renaming for i4c[£;[2:][2;/[ ]]] ^'^"^ ^''■i-D' lemma B3I5I we get 

*iclElx]lx/[-]]] = '-^IclElx/t]] 

And so A is a unifying renaming for xl^^^^^^^-^-^ and u^^. 

• unfr) As in the previous case. 

This concludes the proof. □ 

Lemma 28 (t,C) ^ {u,D) implies there is no unifying renaming for andu\.j-^. 

Proof. The relative unfolding of a lambda is always a lambda, and the relative unfolding of an 
application is always an application. Therefore for rules err^© and err@A the relative unfoldings 
have different topmost constructors and thus are different. For rules err^^;, err^j^, err^;® and 
eTT@x note that Lemma l23l 1 1 states that the relative unfolding of a non-substituted variable is a 
variable, thus again the relative unfoldings of the two terms have different topmost constructors 
and are different. For rule A3 if u = ^ then it follows from the i.h. and Lemma I23I3I 

Otherwise suppose that there exists z ^ y s.t. {{x,z)} G v. By Lemmas and B71 we 
get that V is an unifying renaming for i4c[Ax [■]] ^^'^ ^-^D[\y [•]]■ Moreover, by Lemma 1241 we get 
^i(v) = Mt\c^xx.[-]]) and Mv) = fv(w43j^^ j.jj). 

Now, by Lemma [23l3] we get (\x.t)\^ = Aa;.(<4^j^^ j jj) and {Xy.u)l^ = '^2^-("4-_D[Ay. [■]])• 
{{x,z)} e V we get that x e fv(t4^j^^ jjj). It is easily seen that in this case if there is an 
unifying renaming for {Xx.t)l^ and {Xy.u)]^jj it is necessarily obtained from an unifying renaming 
for u^jjf^^y [.J] and u^jj^^^y [.jj by removing the pair (x, y). We know that v is an unifying renaming for 
^-^D[Xy [■]] and '>J'-i-]j[xy [•]]• Lemma l26l it is a bijection. By hypothesis it contains the pair (a;, z), 
and so it cannot contain the pair (x, y). So there cannot be an unifying renaming for {Xx.t)\^ and 
(Ay.uU^. □ 

Theorem 29 (Correctness) The Unfolding Checking Algorithm, on input (a,b), returns yes iff 
ai=bl. 

This is not the end of the story, however — one also needs to be sure about the complexity of the 
algorithm, which turns out to be polynomial: 

Proposition 30 (Complexity) The Unfolding Checking Algorithm works in time polynomial in 
\a\ + \b\. 

Proof. The following observations are sufficient to obtain the thesis: 

• The number of entries in M is |a||6| in total. 

• At every iteration, one element of M changes its value from # to some non-blank v. 

• Step [5] can clearly be performed in time polynomial in \a\ + \b\. 

• Computing the predecessors of a pair P can be done in polynomial time, and so Step [3] can 
itself be performed in time polynomial in \a\ + \b\. 

• Rules in Figure [2] can all be applied in polynomial time, in particular due to Lemma [22] As a 
consequence. Step |4] can be performed in polynomial time. 

□ 



6 Encoding Turing Machines 

A cost model for computation time is said to be invariant if it is polynomially related to the 
standard cost model on Turing machines. In sections [3] and SI we proved that head reduction of 
any A-term t can be performed on a Turing machine in time polynomial in the number of /3-steps 
leading t to its normal form (provided it exists). This is proved through explicit substitutions. 
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which in Section [5] are shown to be a reasonable representation for A-tcrms: two terms t and u in 
A[.] can be checked to have the same unfolding in polynomial time. 

The last side of the story is still missing, however. In this section, we will show how head 
reduction can simulate Turing machine computation in such a way that derivational complexity of 
the simulating A-term is polynomially related to the running time of the encoded Turing machine. 
Results similar to the one we are going to present arc common for the A-calculus, so we will not 
give all the details, which can anyway be found in [4]. 

The first thing we need to encode is a form of recursion. We denote by H the term tt, where 
t = Xx.Xy.y{xxy). H is a call-by-name fixed-point operator: for every term u, 

Hu {Xy.y{tty))u u{Hu). 

The A-tcrm H provides the necessary computational expressive power to encode the whole class 
of computable functions. 

The simplest objects we need to encode in the A-calculus are finite sets. Elements of any finite 

set A = {ai, . . . ,a„} can be encoded as follows: [a^]^ = \xi Ax„.Xi . Notice that the above 

encoding induces a total order on A such that < aj iff i < j. Other useful objects are finite 
strings over an arbitrary alphabet, which will be encoded using a scheme attributed to Scott. Let 
S = {ai, . . . , an} be a finite alphabet. A string in s S S* can be represented by a value [s]^ as 
follows, by induction on the structure of s: 

\eY' = Acci Xxn-Xy.y [oir]^ = Xxi A.T„.Ay.a;,; [r] ^ 

Observe that representations of symbols in S and strings in S* depend on the cardinality of S. 
In other words, if s G E* and S C A, [s]^* ^ [s]'^*. 

Of course, one should be able to very easily compute the encoding of a string obtained by 
concatenating another string with a character. Moreover, the way strings are encoded depends on 
the underlying alphabet, and as a consequence, we also need to be able to convert representations 
for strings in one alphabet to corresponding representations in another, different, alphabet. This 
can be done efficiently in the A-calculus by way of a term j4C(I]) which append a character to a 
string (both expressed in the alphabet S) and a term CS{Y,,A) which converts a string s G S* 
into another string in A obtained by replacing any character in S — A by the empty string. AC{T,) 
works in time independent on the size of the input, while CS{T,, A) works in time proportional to 
the size of the argument. 

Lemma 31 Given a finite alphabet S, there are a term AC{Y,) and a constant G N such that 
for every a G S, every term t and every s G S*, there is n < such that AC{'E,)t\a']^ [^1^ 
tlau]^' . 

Proof. The term we are looking for is defined as follows: 

AC{J:) = Xy.Xa.Xu.aMi . . . M^uy 
where for any i, Mi = Xu.Xy.y{Xxi Xx^-^^.Xw.Xiu). Observe that: 

ACiJ:)t\a,]^\u^^' ^3 \a,^^Mi...M\^\u]^'t 

^fl M^lu^'^'t^ltlaM^'. 

In other words, can be set to be + 5. This concludes the proof. □ 

Given alphabets S and A, the function : S* — > A* is defined by stipulating that for every 
71 G N and every ai, . . . , a„ G S, G^iai ■ ■ ■ a-n) ~ ui . . . w„. where Ui is ai if Oi G A and Ui is e 
otherwise 

Lemma 32 Given finite alphabets S and A, there are a term CS(T,, A) and a linear function 
fc's^ : N ^ N such that for every m G S there is n < f^g^{\u\) such that GS{T,, A)t\u']^ — >^ 
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Proof. The term we are looking for is defined as follows: 

CS{^, A) EE H{Xx.Xz.Xu.uNi . . . N\^Nz), 

where for any i, 

— ( Xu.Xz.x{Xu.AC(T,)z\ai]'^u)u if £ A 
' ~ 1 Xu.Xz.x{Xu.zu)u otherwise. 

and N = Xz.z\e']'^'\ Let P, be N^{CS{J:, A)/x}. Then the thesis can be proved by induction on 
u as soon as f^'g is defined as /^^ (x) = a:(|S| + + 7) + + 5: 



CS{J:,A)\a,u 



s* 





[Xz.Xu.uPi 


...p^^^Nz)t\e^''' 




\erpi... 






Nt -^^t\e\ 


A* 




(Xz.Xu.uPi 


...P\^Nz)t[a,u]^' 




\a,u^^'Pi. 


■■P^Nt 




P^H^'t 





Now, if fli G A, then there are n < f^'g {\u\) and m < such that 

P^\u\^'t -^l CS{T.,A){Xu.ACiA)t\a,']'^u)\u']^' 
->JJ {Xu.AC{A)t\a,^^u)\gl{u)-\'^' 
AC{A)t\a,^^\gliu)^^' 

while otherwise there is ri < f'cs^{\u\) such that 

P,\u']^'t ^2 CS{i:,A){Xu.tu)\u'\^' 

^« {Xu.tu)\gl{u)^^' 
t\gl{u)^^' 

This concludes the proof. □ 

A deterministic Turing machine 7W is a tuple {T,,abiank,Q,qimUai,q final, S) consisting of: 

• A finite alphabet E = {ai, . . . , a„}; 

• A distinguished symbol auank G called the blank symbol] 

• A finite set Q = {gi, . . . , of states; 

• A distinguished state qmuiai G Q-, called the initial state; 

• A distinguished state qfi„ai G Q, called the final state; 

• A partial transition function S'.QxYi^QxTiX {^,— such that S{qi,aj) is defined iff 

qi 7^ qfinal- 

A configuration for is a quadruple in S* x S x S* x Q. For example, if 5{qi, Oj) = {qi, Ok, ), 
then M evolves from {uap,aj,v,qi) to {u,ap,akV,qi) (and from {e,aj,v,qi) to {e,abiank,akV,qi)). 
A configuration like {u,ai,v,q final) is final and cannot evolve. Given a string m G E* which does 
not contain any occurrence of atiank, the initial configuration for u is {e,abiank,u,qinitiai), while 
the final for u is {e, a biank,u, qfinal) ■ 

A Turing machine CS.abiank^QyqimUahq final, S) computes the function / : A* A* (where 
A C E and abiank is not in A) in time g : N — >■ N iff for every u G A*, the initial configuration for 
u evolves to a final configuration for f{u) in i7(|m|) steps. 

A configuration (s, a,v,q) of a Turing machine = (E, abiank,Q,qintttai,q final, S) is represented 
by the term \(u,a,v,q)']^ = Xx.x\u'']^' [a]^ \v']^' \q'\'^. 

We now encode a Turing machine M = i^,abiank,Q,qimUai,qfinai,S) in the A-calculus. Sup- 
pose E = {ai, . . . , a|5]|} and Q = {qi, . . . ,q\Q\}. The encoding of A4 is defined around three 
A-terms: 
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• First of all, we need to be able to build the initial configuration for u from u itself. This can 
be done in time proportional to |m| by a term I{A4, A), where A is the alphabet of u, which 
can be different from S. I(A1,A) simply converts u into the appropriate format by way of 
CS{A, E), and then packages it into a configuration. 

• Then, we need to extract a string from a final configuration C for the string. This can be 
done in time proportional to the size of C by a term J-{A4, A), which makes essential use of 

cs{j:,a). 

• Most importantly, we need to be able to simulate the transition function of A4, i.e. compute 
a final configuration from an initial configuration (if it exists). This can be done with cost 
proportional to the number of steps A4 takes on the input, by way of a term T{M)- The term 
T{A4) just performs case analysis depending on the four components of the input configuration, 
then manipulating them making use of ACCE). 

The following three lemmas formalize the above intuitive argument: 



Lemma 33 Given a Turing machine M. = CS^ahiankyQ^QinitiahlfinahS) and a blank-free alphabet 
A, there are a term2{A4, A) and a linear function f^'^ such that for every u G A*, there inn 
such that I{A4 , A)t\u]^ -^^ t\C~\-'^ where C is the initial configuration for u and n < f^'^{\u\). 

Proof. Simply, 

IiM,A) = Xx.Xu.CS{A,E){Xz.x{Xy.y\e]''' z \q\'^))u; 

Indeed, 

I{M,A)t\u^^' -^l CS{A,J:){Xz.t{Xy.y\e^''' \aMank^'' z M^))M'^' 
{Xz.t{Xy.y\e\^' \auank\^ z \q\'^))\uf' 
^hi(Ay.2/rel^* WMank}"" M^* \q\'^))\u\^' 

where n < f^^lu]. This concludes the proof. □ 



Lemma 34 Given a Turing machine A4 — abiank-, Q,qmitiai,<l final, S) and for every alphabet 
A, there are a term J-{A4,A) and a linear function f^'^ such that for every final configuration 
C forue A* there isn€N T{M,A)t\C]^ -^'^ t\u'\^\ where n < f^'^{\u\). 



Proof. Simply, 



J"(7W, A) = Xx.Xy.y{Xv.Xa.Xu.Xq.CS{J:, A)xu) 
f^^^{x) = f^cs^{x)+7 



Indeed, 



J'{M,A)t\C^^ -^l \C^^{Xv.Xa.Xu.Xq-CS{J:,A)tu) -^l CS{J:, A)t\u']^' 

^-t\u^^' 

where n < f'^'^\u\. This concludes the proof. □ 



Lemma 35 Given a Turing machine M. = i^,abiank,Q,qinitiahq final, S), there are a term T{M) 

fM 

IT 



and a linear function fj^ such that for every configuration C , 



if D is a final configuration reachable from C in n steps, then 7~(A^)i[C]^ ^J^' t\D~\-'^ whe 

fM 
IT 



m<f^{n); 



• the term T{M)t\C^^ diverges if there is no final configuration reachable from C . 
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Proof. T{M) is defined as 



H(Xx.Xz.Xy.y{Xu.Xa.Xv.Xq.q(Mi . . . A/|Q|)wavz)), 

where, for any i and j: 

Mi = Xu.Xa.Xv.Xz.a[Nl . . . Nf'^)uvz; 

Xu.Xv.Xz.z{Xx.xu\aff-v\q^'^) if qi = qjinai 

^ Xu.Xv.Xz.xz{Xz.zu\ak']^v\qi']'^) li d{qi,aj) = {qi,ak,i) 

= ^ Xu.Xv.Xz.uPI''' ...pI^^P^'^vz if (5(g,,aj) = ((7;,afc,^) 

Xu.Xv.Xz.vR^^'^ . . . R^^^^^R'^'^uz if 5{qi, aj) = {qi, a^, — !►); 

= Xu.Xv.Xz.AC{J:){Xw.xz{Xx.xu\a^]^w\qi]'^))\ak]^v 

P''"" = Xv.Xz.AC{E)iXw.xziXx.x\e^''\aMank^''w\ql^'^))\ak^''v 

r\'^ = Xv.Xu.Xz.AC{Y:){Xw.xz{Xx.xw\a.{\^v\qi\'^))\ak\^u 

i?'-^- = Xu.Xz.AC{l:){Xw.xz{Xx.xw\auank^^\e^'''\q{\'^))\akfu 

It is routine to prove the thesis. □ 

At this point, we can give the desired simulation result: 

Theorem 36 (Invariance, Part II) // / : A* — > A* is computed by a Turing machine At in 
time g, then there is a term U{M,A) such that for every u 6 A*, U{A4, A)\u]'^ — [/('f*)!'^ 
where n = 0{g{\u\) + 

Proof. Simply define A) = Xu.I{M,A){Xx.T{M){Xy.T[M,A){Xw.w)y))y)u. It is routine 

to prove the thesis. □ 

Noticeably, the just described simulation induces a linear overhead: every step of corresponds 
to a constant cost in the simulation, the constant cost not depending on the input but only on M. 
itself. 



7 Conclusions 

The main result of this paper is the first invariance result for the A-calculus when reduction is 
allowed to take place in the scope of abstractions. The key tool to achieve invariance are linear 
explicit substitutions, which are compact but manageable representations of A-terms. 

Of course, the main open problem in the area, namely invariance of the unitary cost model for 
any normalizing strategy (e.g. for the strategy which always reduces the leftmost-outermost redex) 
remains open. Although linear explicit substitutions cannot be directly applied to this problem, 
the authors strongly believe that this is anyway a promising direction, on which they are actively 
working at the time of writing. 
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